A Scalable Communication-Induced Checkpointing Algorithm for Distributed Systems
نویسندگان
چکیده
منابع مشابه
Efficient Checkpointing Algorithm for Distributed Systems Implementing Reliable Communication Channels
This paper presents a new checkpointing algorithm for systems using reliable communication channels. The new algorithm requires O(n + m) communication messages, where n is the number of participating processes, and m is the number of late messages. The algorithm is non-blocking, requires minimal message logging, and has minimal stable storage requirements. This algorithm is also scalable, si...
متن کاملA Non-blocking Checkpointing Algorithm for Distributed Systems
The technology of checkpointing and rollback recovery as an effective method of fault tolerance, has been used widely on the parallel or distributed computer systems. We have presented a nonblocking coordinated checkpointing algorithm for distributed systems, which are differ from the conventional approach of taking first temporary checkpoints and then converting them to permanent ones by proce...
متن کاملAn Efficient Checkpointing Algorithm for Distributed Systems Implementing Reliable Communication Channels
This paper presents a new checkpointing algorithm that guarantees the semantics of reliable communication channels despite the crash and recovery of processes. This algorithm requires O(n + m) communication messages, where n is the number of participating processes, and m is the number of late messages. The algorithm is nonblocking, requires minimal message logging, and has minimal stable st...
متن کاملAn Index-Based Checkpointing Algorithm for Autonomous Distributed Systems
This paper presents an index based checkpointing algorithm for distributed systems with the aim of reducing the total number of checkpoints while ensuring that each checkpoint belongs to at least one consistent global checkpoint or recovery line The algorithm is based on an equivalence relation de ned between pairs of successive checkpoints of a process which allows in some cases to advance the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2013
ISSN: 0916-8532,1745-1361
DOI: 10.1587/transinf.e96.d.886